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This Preliminary Amendment is submitted to improve the form of the English translation 
as filed. It is respectfully requested that this Preliminary Amendment be entered in the above- 
referenced application. 

In accordance with the foregoing, claims 1-10 have been canceled and claims 11-20 
have been added. Thus, claims 1 1-20 are pending and are under consideration. 

A substitute specification is also being. filed herewith. The substitute specification is 
accompanied by a marked-up copy of the original specification. 

If there are any questions regarding these matters, such questions can be addressed by 
telephone to the undersigned. Othenn/ise, an early action on the merits is respectfully solicited. 

If any further fees are required in connection with the filing of this Preliminary 
Amendment, please charge same to our Deposit Account No. 19-3935. 



Respectfully submitted, 



STAAS & HALSEY LLP 



Date: 





Richard A. Gollhofer 
Registration No. 31,106 



1201 New York Ave, N.W., Suite 700 
Washington, D.C. 20005 
Telephone: (202)434-1500 
Facsimile: (202)434-1501 
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MARKED-UP SUBSTITUTE SPECIFICATION 

Doscr i Dtion T ITLE OF THE INVENTION 

SELECTION OF TH6-USER LANGUAGE ON A 
PURELY ACOUSTICALLY CONTROLLED TELEPHONE 

CROSS REFERENCE TO RELATED APPLICATIONS 

fOOOn This application is based on and hereby claims priority to German Application No. 
10256935.5 filed on December 5. 2002, the contents of which are hereby incorporated by 
reference. 

BACKGROUND OF THE INVENTION 

[0002] In communication and information equipment, text information is displayed in the 
language specified by the country version. Accompanying this, there is the facility for the user 
to set the language required as the user language or operator language. If - for whatever 
reason - the language of the user interface is now altered, the user faces the problem of 
resetting the user language required without the option of being guided to the relevant menu 
entry or control status by feedback in text form. 

[0003] This problem is a general one and is not restricted to graphical user interfaces with 
keyboard or mouse input. On the contrary, there will in future be more and more terminal 
devices which are operated purely acoustically. The problem is also faced at call centers which 
are operated purely acoustically. Here, speech, input is effected via speech recognition and 
speech output either through the playing of preproduced speech recordings or through 
automated speech synthesis in the form of a text-to-speech conversion. 

[0004] In devices with a screen input or display input and keyboard input, the following 
procedure is found for solving the problem shown: in general, there is the facility for resetting 
the device to the factory language setting. This is usually carried out by m e an s of a defined key 
combination. There are also devices in which a language menu can be activated in a simple 
manner, the user being able to select the target language. This then looks approximately as 
follows: 
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Deutsch 
Frangais 
, English 
YKpaiHeub 
(Ukrainian) 
Romanesc 
(Romanian) 

Table 1 

[0005] In this menu, the user can now select the required user language to be set. Such a 
procedure is of course not possible for purely acoustically controlled devices. 

SUMMARY OF THE INVENTION 

[0006] From this starting point, the-an obiect of the invention is to enable the selection of the 
user language of a device by m e ans of a purely acoustic method. The selection facility is also 
designed to be available in particular in! cases where the device cannot, or is not intended to, 
provide assistance through a display. Th i s obj e ct i s ach ie v e d i n th e i nv e nt i ons sp e c i f ie d in th e 
ind e p e nd e nt c l aims. Advantag e ous e mbod i m e nt s w ill e m e rg e from the s ub - c l a i m s . By mean s 
of the i nvention, tho 

[0007] The user language to be set for a device can easily be set, simply by speaking the 
user language to be set in order to select the user language. An English person therefore says 
"English", a German person simply says "Deutsch", a Frenchman says Trangais" and a 
Ukrainian says "Ukrajins'kyj" (English transliteration of "Ukrainian" in Polish script). 

[0008] The implementation of this functionality in the speech recognition means -unit of the 
device is no trivial matter, which is why preferred options will be described in greater detail 
below. 

[0009] One option consist s in is,training a single-word recognizer to recognize the designa- 
tions of the user languages which can be set; Since the algorithms used here are chiefly based 
on a simple pattern comparison, a sufficient, number of speech recordings in which the speech 
of mother-tongue speakers is recorded in relation to the relevant language is needed for the 
training. A dynamic-time-warp (DTW) recognizer, in particular, can be used for this. 
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[0010] If the device should already have phoneme-based speech recognition, for example for 
other functionalities, then it Is advantageous to employ this for setting the user interface 
language. There are three options for doing this. 

[0011] For example, a multilingual Hidden Markov Model (HMM) which models the 
phonemes of all the languages can be used in. the speech recognition-Rwafl s unit . A 
standardized representation of a phonetic alphabet, for example in the form of SAMPA 
phonemes, is particularly advantageous for this purpose. 

[0012] As convincing as this approach is for the problem definition outlined, multilingual 
speech recognition meafts- techniques have in practice shown themselves to be inferior to 
language-specific modeling in terms of their recognition rate. A further acoustic model, which 
would use up further memory space, would therefore be needed for normal speech recognition 
in the device. 

[0013] A different option, in which the phoneme sequences from the HMMs, which phoneme 
sequences are associated with the designations of the user languages to be set, are combined 
for the different languages, therefore proves to be advantageous. It must, however, be borne in 
mind here that the degrees of match which the speech recognition system delivers for the words 
modeled in different phoneme inventories are not/directly comparable with one another. This 
problem can be circumvented if, in the combined HMM, the degrees of match for the phoneme 
sequences from the different recognizable user languages are scaled. 

[0014] A particularly clever option is produced if, instead of one multilingual HMM or the 
combination of phoneme sequences of several language-specific HMMs, only one single 
language-specific or country-specific HMM is used and at the same time the designations of the 
foreign user languages are modeled using the language-specific phoneme set. The example 
below for German, which is based on the menu in Table 1 , serves as an explanation of this. The 
word models are in "phonetic" orthography: 

/ d eu t sh / 
/ f r o ng s ae / 
/i ng 1 i sh / 
/u k r ai n sk i j / 
/ r o m a n e' sh t sh / 
Table 2 
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[0015] Here, the need to use a multilingual HMM or to combine phoneme sequences having 
different phoneme inventories in the recognition process does not apply. 

[0016] In accordance with the introductory definition of the problem, the device is in particular 
a mobile terminal in the form of a mobile or cordless telephone, a headset or the server of a call 
center. 

[0017] Preferred embodiments of the method according to the invention will emerge in the 
same way as the preferred embodiments of the inventive device shown. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[001 8] Furth e r e ssontial foatur e s T hese and other objects and advantages of the present 
invention will e m e ro e become more apparent and more readily appreciated f rom the following 
description of an embodimen t, taken in conjunction w ith r e f e r e nc e to t he accompanying drawing 
m-of.which: 

Figure 1 shows -is a flowchart of t he procedure for setting the user language. 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0019] Reference will now be made in detail to the preferred embodiments of the present 
invention, examples of which are illustrated in the accompanying drawings, wherein like 
reference numerals refer to like elements throughout. 

[0020] The device can be implemented in the form of a cordless headset which is controlled 
exclusively via speech. This may for example be a headset which establishes, with or without 
cable, a connection to a base via Bluetooth, Dect, GSM, UMTS, GAP or another transmission 
standard. / 

[0021] The headset has an on/off button ahd a so-called "P2r (push-to-talk) button, by 
m e ans of which the audio channel is switched for a defined time window to the-speech 
recognition-meafis unit . The command control of the headset includes the brief pressing of the 
P2T button, an acknowledgment of the pressing of the button by a short beep and the 
subsequent speaking of the required command, to which the device responds accordingly. 
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[0022] When the device is first switched on (step 1) or after resetting of the device (step 2). 
which is caused, for example, by holding down the P2T button for a longer period, the user 
initially finds him-/herself at the user-langusige selection stage. This is communicated to the 
user by an acoustic signal (step 3) which cons i sts , for example, of-a longer beep or a 
multilingual request to speak the user language to be set. 

[0023] The user then speaks into the device, in the language to be set, the designation of the 
language to be set (step 4). The speech recognition meaRS -unit of the device then recognizes 
the designation of the user language to be set spoken in the user language to be set, provided 
that the user language to be set is one of the several user languages settable for the device. 
The user language setting meafis -unit of the device then sets the user language of the device to 
the user language recognized by the speech recognition-mean s unit , as a result of which the 
device is initialized appropriately. The device can then be operated (step 6) as if it had been 
switched on normally (step 5). 

[0024] Tried and tested means and methods from the prior art can be used to correct speech 
recognition and operating errors. 

[0025] All the embodiments of the invention share the outstanding advantage that they 
significantly simplify and speed up operation of the device. Furthermore, where phoneme- 
based recognition is used, there is no need for speech recordings to be stored in the device. 
Optimal use is made here of the fact that phoneme-based acoustic resources are already 
present in the device. 

[0026] The invention has been described in detail with particular reference to preferred 
embodiments thereof and examples, but it will be understood that variations and modifications 
can be effected within the spirit and scope of the invention covered bv the claims which may 
include the phrase "at least one of A. B and C" as an alternative expression that means one or 
more of A. B and C may be used, contrary to the holding in Superauide v. DIRECTV, 
69 USPQ2d 1865 (Fed. Cir. 2004). 
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